Model Selection

Low memory usage

# Low memory usage

Fastvlm 1.5B Stage3 MNN

FastVLM-1.5B-Stage3-MNN is a text generation model based on the Transformer architecture. It is an 8-bit quantized version of FastVLM-1.5B-Stage3, suitable for text generation scenarios such as chatting.

Large Language Model English

Qwen Qwen3 8B GGUF

Quantized version of Qwen3-8B, quantized using the imatrix option of llama.cpp, suitable for text generation tasks.

Large Language Model

EXAONE 3.5 32B Instruct GGUF

EXAONE-3.5-32B-Instruct is a large language model with 32B parameters, supporting instruction following and dialogue tasks.

Large Language Model Supports Multiple Languages

Impish Mind 8B GGUF

Quantized version based on SicariusSicariiStuff/Impish_Mind_8B model, processed with llama.cpp tools for various quantization methods, suitable for text generation tasks.

Large Language Model English

Esmplusplus Small

ESM++ is a faithful implementation of ESMC, supporting batch processing and compatible with the standard Huggingface interface without requiring the ESM Python package. The small version corresponds to the 300-million-parameter version of ESMC.

FLUX.1 Lite GGUF

Flux.1 Lite is a distilled 8-billion parameter Transformer model derived from FLUX.1-dev, optimized for text-to-image generation tasks, reducing memory usage while maintaining accuracy and improving speed.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase